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Xbal (-6372) 
-6373 TCTAGAGTCAGGATGGCACTGAAGGTCTC 

- 6 3 1 1 ACCCTCCCCCCTTTCCTGGGTGATAGAGAAGACTCAGAACTTCACGCC 

Apal (-6233) 

- 62 4 9 CCCTACCTGCAGCCAGGGCCCGGTGCGATGAGAGCCCCCAGACCTCCCTGAAGGGTGAGTGA 

- 6187 GTGTCACAAGTGCCACATGCAGCTGTTCTGCC 

- 6 12 5 GCCCGCCACACCCCAC^GACCTGGAGCAGAGAGACAAGAAGGCCCTACGCTCAGACACTC 
-6063 CAGGCTAGGC CAATTAGGATGCCCAGGCAGGGCTTATGAAAAAGGAACATGGAAAGGAA 

Hindlll (-5985) 

-6001 CCAGGGTGCCCTAGGAAGCTTAAGAAAGAACGCTGGAGCCAGATGCTTGGGTTCCAA 
- 5 9 3 9 GCTG<^CCACTTCCTAGCTGTGTGACCTTGAATCAAATCACATTATCCTA 
- 5 87 7 TCCCCCTTCTGTAAAATGGGCATCATAATGTCAGTGCCTTCCTC 
-5815 GACCACGGGAGGCAATGXIAGAGCATGCTCTCGGCACAGTGXIC 
-5753 AATGGCATCATCTCACCAGGCCTATCTTGGGTTGRGTGGGCTG 

BamHI (-5667) 

-5691 ACTGCCATTGGAGTCTGAGAAGCGGATCCTGGTAGGGCGGTC 

- 5 62 9 GGGTGAGGCCGGACTGAGCCAAAAGCAGCCCCTCCCAGCTCTCCCAGTTTCCCTCCSGGCCC 
- 5 5 67 CGGCAGCGTGACCCCTCCTTGCTCCTrcCCCTTTCTCACCGCCTC 

-5505 GAGGCTAGAGCGCCAGCAGCGAGACTCGGCTCGTGCCACCGCCTGCGACCTCGGCCCTGT^ 
-5443 GCAGCGCCACGAAGTCTGGGACGGGAGGAAGATGGCCTGAGCACTGTCAAACGC 
-5381 TGGCCCAGCCTCAACCACAACCCCGCTGTTCGCCAGCCCCCTACCCGTGTGGCCGTCACCAC 
Apal (-5318) 

-5319 GGGCCCGCTCCTCAGCGCCIGGCTCCCCGCGGTCGCT 

-5257 GGATACACGAAGGAGAGGCCGCTCGGCTGCCGCTCCGAACTGCTGCGCTCTGCGGS 

- 5 19 5 GTAAGAACACGGGCTTCAGCTGGCCATGGGAAAGGCCAGTCCGACGCCCCATCCAAGTGGCC 
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-5133 CGGGACCTAGTATCGTGGCCCTGCCTCCCTCCCCGCAGCGGAGCAAGACTTACCCTGGGGGC 

- 5 07 1 AGGTCTGGCAGCAGTGTCCCGGCAGCTGGCGCGG^ 

- 5 009 GX3TTTGATGTTCTTGCAGCTGACCCTGCCAGGC 

-4947 TCCCGGAAAAGGCGGGAAACCCAAGTGAGTGCAAGATGCCAACTGATGAGACCCCCCCAGGC 

- 4 8 8 5 AAGGATCTCCCGCAGAGTCAGCCAGCTCTGCCACTTACAAGCTGCG 

-4823 ACTTGATCTCTCTGGGCCTCAAGGTC CCTGTCTGGAAAATGGGGATAATAATACTCTCTATC 

-4761 TAGCAAGGCTGCCATGAGAGTTAGATGAGCAGGGAACGAAAC^GAGTTGGCACAGAGCCTCA 

-4699 CACAGAGTGGGCGATCAGTAACAGCACCTAAGAATTC 

-4637 CAGAAAAATATCCCCAACATCTGCCGACTGGGCTCCTTCTCAGCAGCTC 

-4575 ACGCCCGCGCGACCCGGCCGTCCCCACCCGCCAGCCCGGGCCGGCCGCGGGGTGCACTCACC 

-4513 GCCTCGCAGGCCACAGCACGCAGCGCAT^^ 

-4451 GTCTCGTCCAAGGCATAGACCTTCCCGCCGAAGTGCAGCCTGCGGGA 

-4389 GCGCI^CCAGCTCGCGCCGTGTGCCGCCCCGG 

-4327 ACCGCGCCCTTCTCCCCCCGCCCACCCTCCGGG^^ 

- 42 6 5 CGCGCCCCTCCTCCGGCTCGGC^CrrCGCCCCGAGCCCGACTCCCCG^ 

-4203 GCGCCCACCTACCCTGCTGCCCGAACGGGCAGCGGCTC 

Sfil (-4079) 

-4141 GGGGGCTCTCGGGCCGCGCGGGGCGGGAGCCC3AGCAGCAGCAGCCCGA 
-4079 GCCGGCGGGGCCGGGAGGGCHCGGCATGACGCGAACGGGACAGCTGGGGAGGAGGGAGGGAG 
-4017 GAGGGCGCGGGAGCGGGCGGAGGGAGGGAGGCGGGAGTGCGGAGGGCGGAGGGCCGGGCCGG 
-3955 GGGCGGTGCGGCGGGAGGGGGCCGGGGCCGGGGCCGGGGCCGGGGCAGTGCCCGCGAGGGGC 
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Notl (-3885) 

- 3 8 9 3 TCGTCGGGCGGCCGCAGAGTCGGCGCCGGGCCGGGCGGGGCGGGAGGAGCGGCCGGGAGGAG 
-3831 CGCGGGCGGGCGGGCGCTGACCCGGGCCGTACGCGGCTCTACTGCCCCGGGCGCCGGCTCCG 

- 3 7 6 9 GCCCGTTTTATGCCCCGCGCCCGACGCCCCGGCCGGGGGCCTCCTCCTCAGCAAACGGGGCG 
- 3 7 0 7 GCGGCGGCGGCTCGGCGAGGGGCCGCTGAGCCCGGGGGGTCCGACCCAGCAGCAGCGGCCCG 
-3645 GATCGCGGGTGGGGGAGGGGAGGGAGGGCTGGGACCGGGCAGGGG^ 

-3583 GGGGAAGGGGGAGCGGGGGAGGGGGAGGGGAGGGACCAGGGGGCGCGAAGAGGGGGAGGAGA 

- 3 52 1 GGCGGCCCGGAGCCCCCGCTGCTGGCGGCCACAGGGCGGCTGGACCAGGAGGTC 

- 3 4 5 9 GCCCAGGAAGGGAGCCTCAGGCTAGGGAGGGGa^GAGG 
-3397 GTCAGCGAGGCCCGGTTCCGCCCGAAGGATAAACTTGTCTTTAAA 

Apal (-3307) 

-3335 GGTCC^TCAGCCGATCTCCCCCTSCCTGGGCC^ 

-3273 TCTCACTGCCTAGCCTGCCTCCCTACCTTC^ 

-3211 CAGRACAGGGACCTAGCCAGAAACCGGCAGCATTCCCCCTTCTC 

-3 149 CTCTGATTGTAACTTATCCTCAGGCGCATTCGACAGTCC^ 

-3087 CTTCACCCAAGCXSACCCTCTCCCTCTCC^^ 

-3 02 5 GGTCATGCCTGCCTCCCTGTCTCCTGTCTC 

-2963 TATCCCAGCACCCTCCTTCCTAATCTTGGGAGACATCrrc 

-2901 AGGATCTAGGCCACACTTCTCAGCAGACATGCCCATCC^^ 

-2839 CCTGAGGAAGTTCTGGGGGACAGGGGGATGATGGGATCAAGGTCAGGCCAGGA 

-2777 GGACAGAGACTGTGGGGAGACTTGGGACTGGGAAGAAAGCAAAGGAGCTAGAGC 

-2715 AAGGAAAAGGGGGGCCAGCAGGGWGGTATTTGCGGGGGAG 
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-2653 GACAGGGACACATGGGCCTGGTTATTCCTCTTGTCACATC 

Apal (-2568) 

- 2 5 9 1 CGGAGACAGAAC^GCAAAGGAGGGCCCTGGGCACAGAGGTCTGTGTGTGTAGCC^TCTAAG 
-2529 CGACTGGACCCCAGCAGACGAGCACCTAAGCTCAGGCTTAACCAGTC 

- 2 4 67 ACTGTCCCCCGCACCTCACGTCC^CTCAACCC^ 

-2405 ATAACAGGAGATTTCTCTCATGTGGGCAATATCCGTGTTCCCACTTCGAAAG 

-2343 AAGATAGGACTC CCTAGGGGATTACAGAAAGAAAAGCAGGAAAGCAAGCATCCTGTTGGATT 

-2281 TCAGCAGCAGGTATGATGTCCAGGGAAAAGAAATTTGGATAGCCAGGGAGTGAAAACCCCA^ 

-2219 CAATCTTAAACAAGACCTCTGTGCTTCTTC^ 

-2157 CTGGAAAAAACTTCTGCTCCTGTCCCCCTCGAGGTCCAGGTTC 

-2095 GGATCCCCC^ 

-2033 CCAGGCAGGGVGCTCCAGGGAAC^GCAAGGCGTCACTTCCGGGSGCCTT^ 
-1971 TGGCTCCCITCTCTGATTGGGCAGAAGTGGCCCAGGC^ 
-1909 GGGGCTGTGCCCCACCGCCACATG 
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-1885 tcttcctacc^ & TPTc^TCcrp AC?AGGGC' r ry^rT^(Tr^T<y i A CTTOGGT^CTr^^AGCCCTTC 

-1823 TCCACCCGG TGAGTGGCCAGCAGGGTGTGGGGTTATGTGAGGG^ 

-1761 AAATGGGCTCCGAGCTGGGGGAGGGGCAGGCAAACTGG 

-1699 GAGAAGAGTGTAGCCTTCCCAGAATGGGAGGAGCAGGGCAGAGCAGGG^ 

-1637 GCTGKTTTCCTGAGGGACTGATCACTTAC^ 

-1575 AAGGAAAGGGGACATGAGCCCAGGGAGAAAATAAGAGAGG 

-1513 AACACAGTAGTAAGATCGACACAGCCCC^^ 

-1451 TTAAGGTTCTGAATCTGGTGCTGGGGAAGCTGGGCCAGGCAAG^ 

-1389 TAATGGGAGGAGGGCCG2S 

-1327 G^Gc'cTCTrTGCACAACTTCTCA^ 

-1265 ACCGGAAGGGGTTCTGCCAAGGGGGCAGG 

-1203 GTGGGCGCCTAAGACAAGGTAAGCCCCTAAGGTGGGCA 

-1141 GGCAGCTGGTTTCAGGAAGGAAGTCCCAGAACTGTTAGCCCATC^ 

-1079 GAGTATTTCAGGACTTGGAGTCCAGAGAAAAGCTC 

-1017 GGGAAAGAATAGAGGTTAATTTCTCCCATACCGCCITTTAATC 

-955 GTTACAGCITIGTCCAGr^^ 

-893 CATATTGCGCCCGTTTGCCAGTTCCTCACCCAGGCCCTGCATC 

-831 CCAGGCTGAAGCCACAATACTTTCCTTCTCT^ 

-769 ACCAAGGTTGCTCAGAATTTAAGGCTAATTAAGATATG 

-707 GCTCTCAGCAGGGGTAGGTGGCACCAAATCCATGTCCGATTCACTGAGGAGTC 
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645 AGGAGACACCATATGCTTTCTTC^ 

582 ACGGAGTTTCACTCTTATTGCCCAGGCTGGAGTGCAATGGTGCGATCTC 

519 TCCGCCTCCCAGGTACAAGCGATTCTCCTGTCTCAGCCTCCCAAGTAGCTTG^ 

456 TCAACCACCACACCCTGCTAGTTTTTTTC 

393 AGGCTGGTGGCGAACTCCTGACCTCAGGTGATCCACCCGCCTTGGACTC 

EcoRI (-268) 

•330 TTACAGGCATGAGCCACTGCACCCGGCACA 

• 267 ATTCAGGGCTITGGCAGTTCCAGGCTGGTCAGCATCTCAA 

-204 CTGCCAGGCAGTCTCTTCCTAGAAACTTGGTTAAATGTTCACTC^ 

.141 mnnm cKXxr^ 

_ 78 accczcrrrc. o ATGc^ctT 1 a ggaag g s rrrry ^ Afyyy? AGA<yyrr^ ft ft AC AG<^ A GCCACGCCAGCC 
-15 a r. a r a rrrrr^CAGA U ATC 1) GAS aSSCES GTGAGAACACACCTGAGGGGCTAGGGCC 



43 ATATGGAAACATGACAGAAGGGGAGAGAGAAAGGAGACACGCTGCAGGGGGCAGGAAGCTGGG 
106 GGAACCCAI 



^TTCTCCCAAAAATAAGGGGTCTGAGGGGTGGATTCCCTGGGTTTCAGGTCTG 



169 CCTGAATGGGAATTCCTCGAATAC^ 

232 TCTCCTCATCTAAGAA H£ ffi CE 2E S£" fiE ^ ^ ^ ^ 

281 ^m^m^^^^^^ mM2a££2:effleE 

329 m^Masmasr^m^^^^^^^^ 

377 AGAACTCCCAACATTATCCCCTTTATCCGCGTAACTGGTAAGACACCCAT^ 

440 CACCATCACTTCCTCTAACTCCTTGACCGAATGACT 

Xbal (562) 

503 GATCACACTCTCTGACAAGGATTATTCTTCAC^ 



566 GAACT 



FIG. 5B 
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Hindi (-4511) 

- 4 5 12 GTCAACCTTCACAGTAATTGCTTGTTCAGTGACTGCCACAACCCAGCCTGGCAGAGAGAGGGAA 

- 4 44 8 GATACCCTATAAAGCAAGGTAACGTTAATGTTGAGACCATGAA 
-4384 ATCATTGCTTCCTTCAAAATTCAGAAGGATCTGAT^ 
-4320 CGTGCAGGTTCAAACCACAGCTCTCGTCCTTCCGCT^ 
-4256 TCCCTGGCCTGCACGTCGCTGGGTCACACAACACrcGCTGTC 
-4192 AGCGTTCCTTTCAGGCCATTTCTTTCCA^ 

- 412 8 ATTTCAGCCAAATCAGAGCATG^ 

- 4 0 6 4 TTTCCCTTCCTGTGTCTGTGACAGGTTTAATTTAACCTGAGA^ 
-4000 GGTGTAAAAAGACCTCATCCTGATCTGAGAAGX^GGTCAGCTT^ 
-3936 GCCAGCACCCATTCTCTGTGGATGTGAAAATCCCAGAAGGGCTGG 

Apal (-3851) 

-3872 CCAGGCCTATCTCCAGAGTGGGGCCCAGCATGGGAGGATTG 

-3808 GGGGCTTGGACCTACAGCTCGAGAGCACCCATGGAATGTGGGCAGAAGCGACAGCA 
-3744 CCGCCTIGGCCTTAGGGCGGCACGTGTTCTGCTTGTGCCCTGGGAGCCTCCACCTTC 
-3680 TGGGAAGAGGGTGCCCAGGGAGCTGCAGTCTCTCCAGCCCAGCCCCAG 
-3616 CAGAGCCACCCCAGCAGACCTGGCAGTGTGAGAGAAATGCATGTGTATACACT^ 
-3552 TGGCTGTTACATGGCAGCATTGACTGACACAGACAGAAAACIAGATCC^ 
-3488 GTGCTCGAGACTCCAAC^GCCACAGGCTGCAGGGGCAGGATGGCTTCTTA 
-3 424 TGTTCTGGGAATCTATCAGAGGAAGACATAGAGGCTCCAGACGGTTGA^ 
Apal (-3353) 

- 3 3 6 0 CCAGACGGGCCCCATGTCAGACCAGGCTCCTCCAGGGCTGTCGCTGCCCTCACCAAAGCCCGTC 
- 3 2 9 6 CTCAGGGCAGCC^CACAGCAGGCAGCACTCGCCATTTGTACAAGCGAG^ 
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- 3 2 3 2 TCCTTCTGGCAGGTAGAGGAAGCAGGGGCACTATGCCTGGGAGTTCCTG 

Espl (-3109) 

- 3 1 6 8 GCATTTGGTCAAGAGCCAGGAGGG^ 

- 3 1 0 4 CACACGTAGGGGGTTGGGC^CTTGCTCTGTGAGCTA 

- 3 0 4 0 GCTGCACCAGGCAGTTTCTTGGTGGAGGACAGTGGCCGCCCTCTA 

- 2 9 7 6 GATGGGTCGCTGTCAGATGTGTGTCCAGGAAAGGCAAACACCAAAGGCA 

-2912 CACCGAATACTCCGGTGGCCTTGCTTGGGGGCTGGGTTTTGA 

- 2 8 4 8 TTAGAGATTAAAAACAGGGAAGAACCATTGCTGAAACCTTTGGA 

-2784 GCAGCCTGAGGAGTGGTGGTGTTTCCATCTGGTAGACGCCGTCTCAATA 

-2720 TGCACC^GTGCTGCCAGCCAGAGGCGTCTGTTGGCGTGTCT^ 

-2656 GTGGAGGGTGGTTTACCTTCCTGTTTCTAGTCCCCA 

-2592 TGGCCAGACCGAGCACITrCCTGACTTTCGACCTTGGCCCC^ 

-2528 TCCTGCAGACCCCATTTGTATTCATTTCCTGCAG 

- 2 4 6 4 GCCAACCGTTCCAGGCCCTCCTCCCAGGGGGACCACAGATGCTACGTG^ 
-2400 GGGCCAGCACAGCCCCTTCCAAGTGGGCAAGACCCAGG 

- 2 3 3 6 AGCCCTGGAACCTCTGAATGTTGATTTTTGTAGC^ 

-2272 TGTTGAGATAAGGACATCCTCCCTGCTCTCTGGGAGGA 

- 2 2 0 8 AGAAGAAGAGGCAGAGACTGGGGTGATGCAGCC^CAACTAAGGA 

-2144 CCTGCAGAAACTGGAGGGCAAGGAGCATCCCCCAACCGCCCGGAGCCTCCAGGAGGCGCAAGGT 

BamHI (-2032) 

-2080 CCTACTGACTCCCTGACTTCAGACGTCCAGTCTCCGGAATTTTGAGA 

-2016 TTTAAGCAACCAAACTTGTGGTAGTlTCACCAGTCTCA 
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-1952 AGATTCCAJVGAAATGAGTGGCGGGGTGCGGTGGCTCACACT^ 

- 1 8 8 8 GATTGCTTGGGCTCAGGACTTGGAGACC 

- 1 8 2 4 CGATCGTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGAT^ 

- 17 6 0 GTTTGAGACCAGTGTGACCAACATGGTGAAACCCTGTCTCTACTAAA 

-1696 GGTGTGGTGGTGCGTGCCTGTAATCC GAGCTACTCGGGAGGCTGAGGGAGAAGAATTGCTTGAA 

- 16 3 2 CCCAGGAAGCAGAGGTTGCAGTGAGCCGAGATAGTATTACrcCACTCCAG^ 

Sphl (-1509) 

- 1 5 6 8 CAAGATTCCGCCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

- 1 5 0 4 ACCTGTGGTCCTCGTACGCCGGAGGATTGCCTGAAGCC^GGAGTTC^ 

- 1 4 4 0 AGAGCAAGACCCCATCTCTACCAAAAAAATTTAAAAATTAGCCAGGCATGGT^ 

- 1 3 7 6 GTCTTAGCTACTCAGGAGGCTGAGGAGGGAGGATTATCTGAGCCTGGCGGTTG 

- 13 12 AGCCATGATTTGGCCACTGCACTCCAGCCTTGGCAACACAGTC 

-1248 ATAAAAACCCAAAACAAAAGAACCAAGAAATTACTGGA 

BamHI (-1162) 

- 1 1 8 4 CTGCCCTKTGACTGGTCACTCGGATCCCTGGGCCTAAACACACAGCCTATTGTC 

-1120 AGGCTCCCCACTGCTTGGCTGGCAATTGGGGTGGCTTTGC^ 

- 10 5 6 GGCGCTGGTGCTGCAGGCCCCCACCACTGCTTGTTCCGAGCTCCCCAGCCTC 

-992 CTGCACCTGATGGCGATGAATCAGGAAGGCAGGC 

-928 CAGCCACCAGGGGGCTCCATTTGCAACTTTGGATGTGGCTTTGGCCTC 

Apal (-860) 

- 8 64 TTGGGGCCCCCAGACAAGAGACAGGGAGACTGGAGCCCAGCCCCACCCTCCCGCACATA 

- 8 0 0 CCCATCCCTGCCCTATCCTGGAAGATGGGGGCCACCACACGTRCAAGGGACACGGGATAGGAAC 
- 7 3 6 CTTTGGCCTTGTTATCAGACATTTTAAAACTAAGTGCAAAC 
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-672 CAGCAGCAAGAAACCTGTGCTTACAGAAAGAAACACGTGCTAGCAACCCACCTATGCGGAAAGCC 
-607 ACACAGAGCCATTCTTTTCTGCACTCTCAGGTGACGG^ 

BamHI (-498) 

-542 CTGCCTGAACTTTTAAAACTCCCAGAGACGCACTGCCT 
-477 GCCAGGG 
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CAP site (-469) 

-470 CCTTGAAGTGCTTCTTCAGAGACCTTTCTTC^ 

-408 AGGAGAAAATTGTCATCAAAGGATATTCCAGATTCTTG^ 

-346 ACATCACCATCATCTCAG GTGAGCACCAGiGT^ 

-284 GCAGGGAGGGAGGCITAGAGTCTCATCCTCCAGCAGCGA 

Smal (-220) 

-222 TCCCGGGCGGGTTTTCTGGTGGATGGAGGAGTGACTCGGGGTCCTCT 

- 1 6 0 TTTGGCTTTCTGGACGTTGTAGGAAAGGGTTTCCCCCGCCTGCGTCC 

- 9 8 CCACCAGCCCCTGCGAGCTGGGCTCCAGAAGGCTGGAGTGCTGTGGC^ 

AUG (1) 

-36 CTTCTGTTATGTCTCTGTGCCCTGTGCTCTCCCAGG ATG AGG GGC ATG AAG CTG 

19 CI£ SSS G£S CT£ STS GCA. CIS SSS GCC. Cffi CJS £A£ SSS GCS SIS 

64 TC£ CIS MS ATS GCA S£S ITS MS AES SAG A£A TH SSS GAS A££ 

109 AAS ATS TC£ AAT. GCS ACS □£ SIS AGS IAS ATJC GTS SAS ATS SIS 

154 ASS CQS THL GAS. Ml GCS CTS STS CAS GAG. SIS AGA GAC. AGS CAC. 

199 CTS AST SSS SIS GGG AAG CTG CTG GAC. AAC. CTS AAT CAG- GAT SCA 

244 CCA SAS ASS TAT CAS TAS SIS STS AST SAS CCA CTS SSA SSS AA£ 
289 ASS TAT MS SAS SSS TAS STS ITS STS TAS ASS SST SAS SAS STS 
334 TST SSS S 
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-8711 AGCTTCTGCTTTAGGAAAGTAGAAAAATAAGAGCAAATTAAATCCAAGGTAAGTA?^AAAAAAAAA 
-8646 AAAAAAAAAAAAGAAATAAAAATTAGAGCAGAAATCAATAAAATTGAAG^ 

- 8 5 8 1 GAAAATCAACATAAAAAGTCTGGTTCTTGAAAAGATATATAAAATTGATAA 

- 8 5 1 6 TAATTAAGGAAAAAAGACAGAGGACACAGATTACTAATATCAAACATAAAAGC 

- 8451 GCAAATTTTATAGGCATTGAAAGCGTAATAAAAGAATACTATAAACTATTCTAT 
-8386 TGATAAGTAAATAGAATGAACCAATTCC1TGAAAGACATAATCTGAAAAATGT 

- 8 3 2 1 TAAACAATCTCAATAGCCTATATCTA^ 

EcoRI (-8223) 

- 8 2 5 6 AGGAAGCACAATGCCCAGATGGGTTCACTAGTGAATTCTATC^ 
-8191 GTATCAACTTTCTACAATCTCTTTCAGAAGAC^ 

-8126 CTAGGCCAGCATTACCTTAATACCGGAACTAGAA 

-8061 CAATATCTCTCATGAACAAAGATACAAACATTTTCA^ 

-7996 TGTATCAAAAAATATACACCACAACCAAGTAGAATTTATTC 

-7931 TTTGAAAATCAATTAACGTAATTTGTCCCATCAACAGGTTAA^ 

-7866 TGATAGACACAGAAAAAGCATTTGACAAAATTTAACACCC^ 

-7801 CTAGGAATAGAGGAAAACITCCTCAGCTTGAATCTA^ 

-7736 AACTCCTCTTAAAAAATAAAGTTTTTCATTT^ 

-7 67 1 GTATCTCATTTTAGACCAATCAGCTATGGATAGTTAGGCGACAGC^ 

-7606 TCTITCTXK3CAATGTTCCAGACTACATTTAAAAAA 

-7541 AAGAAAAATATGAAAATGCTTTGCCGTGTTAATGCTACTCTTT^ 

-7476 ACTITATTTATATTTCATTAGTTTTTTAGCT 
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- 7 4 1 1 ATGCCACATTACATATAATTCTCATGTCTCCTTGGGTTCCrcTTAG 
-7346 TTTTCTTATTTTTGATGACCTTGACAGTTTTC 

-7281 TAAAGTATATTTGTCATGATTTATACTGG^ 

-7216 TCTCATCACATCATATCAAGTTATATACCATCAATATTGCCACAGATC 
-7151 ATTTCTCTAATTTAGTGTATATGCAATGATAGTTCTCTGATTTC^ 
-7086 TAATGATTATTTAGAGTTTCTCTTTCATCTGTTC 
-7021 TGTAAGACTTCTTTTTATAATCTGCATATTO 

- 6 9 5 6 CTGTCATTCTATGGCCTGACTTTTCTTAA 
-6891 TGCAATCTAAITAACAATCTTTTCTTTG 

-6826 ACTGAAGTCATGATGGCATGCTTCTATATTATTTTCT 

-6761 TTAGACTTATAATTCACTGGAATTTTTTTGTC 

-6696 TTTACATATAAATATATTTCCCTGTTTTTCTAAAAAAGAAAA^ 

- 663 1 AATGCCATATTTTTTTCATAGGTCACTTACATA 

-6566 TTTATCAGCCTCACTGTCTATCCCCACACATCTCATGCTTTGCTC 

-6501 AACATTCTTTCCCATTITGTTCTACAAGAA 

-6436 TTTTAGAATGAGGTTGGCAAGTTAACAAACAGCTTTTT^ 

-6371 ATGTGAAAAGAAAGTATACCTTCACAATATTAAGTCTTTTAGTTCATGAATATACT 

Bglll (-6248) 

-6306 CGTTTCTGCATTAACTTAGACATTCATTAATTTCTCTCA 

-6241 ATTGATTTAAATCTTCACTAACCTCTCATTT^ 

- 6 17 6 CTTCTTTGCCTAGATTTATTTCCAAGTAGATTATTTTC^ 
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- 6 1 1 1 AATGTAATTTTI^CCTTTTTATTGCTAAAG^ 

EcoRI (-6032) 

- 6 0 4 6 TGGCGACCTTGCTGAATTCTAATCGTTTATCTATAAAT^ 

- 5 9 8 1 AGATCATCTGCATATAATTITTAAAATCTGA^ 

-5916 AAGAAAATATCCAAATGGTCAATAAACATATGAAAAGATGCTGA 

- 5 8 5 1 ATGCAAATTAAACTATAATGAAGTATTATTGTACAACAATAGAATGACTGAAATTAAA 
-5786 ACAATATCAAAGTTGGCAAGAGTCTGATACAACTGGAACTTCT^ 

- 572 1 AAATTGGTACAAACATTTGGGAAGTCATTACAATATTATC 

- 5 6 5 6 CTATGAGCCAGTTACTTCATTCTAGGCATATACCCAAAAGAAGTATGTACTATTGTGCAGTA 

- 5 5 9 1 AATACAGACAAGGAATTTCATAGGAGCATTAATTATCATGGCAAATATTT^ 
-5526 AGTAGAAGGGATAAAACATTGTGGTATACTTCTAAATAGGGTAAAACACATTAATC 
- 5 4 6 1 AAACTATACACACAAGATAGACGAATTTCGCAGACATTCTGT^ 

-5396 CAAAGCTCAAAAACAGACAGAATCTAGAGTGTTAAAAGACTGCAT^ 

- 5 3 3 1 AAAGTAGTGACGAGAGAGAGGAGAGAGAATAATGATTGCGAGGTGCTATAGTCTGAAGGTTTGTC 

- 52 6 6 TCCCCCAAATTTCACATGTTAAAACCTAATCrc 

- 52 0 1 GTGGATAATTAGGTAATGGAACAAGAGCCCTAACAAATGGGATTGGT^ 

- 513 6 GAGCCTGAGGGACCTTGTTTCCCGCTTCTACCATAT^ 
-5071 GAGCAAGCCCTCATCAGACACTGAATCTGCTAGGGCCTTAGTC 
-5006 CTATAAAAAGAAATGCTTGTTGTTTAAAAGGCATTCAGTCTATCGGTC 
-4941 CAAGAGACTTAAGAGGGAACAAGAGGGCGATTTCTGTTGTG1TGA 

- 4 87 6 CAAAGAGTGCAGACGTTTTTATTTTATAACAATTCATTGAGCT 

FIG. 14C 




EcoR! (-4759) 

- 4 8 1 1 TTTTCTATGTATATTATTGTTTTATAAAC 

-4746 ATGTATTAATCTCTTATGAAAGAGTTTGTTGGCTTCCC^ 

- 4 6 8 1 CTTCATTTATGTTCTGCCACTGCTTATGCACCT^ 

-4616 TTCTITGATAGGGACCCTCTTCCTTGAAAAATAATTGAAGATATAAGGAO 
-4551 ACTAAATGTTTATTTCTAGATACATAGTAGTCTGCATAGA 
-4486 AATTGGCTCCTATCTCTGAAATTTATAGAAAAGCATTTCCAC^ 
-4421 GAAATGTCATTCAAGTTTTACTTTCTAAATGTCAC 

- 4 3 5 6 GAACTGGTGCAGGGACTGGAAGTAGTTTTCTCATACAACGGAAAGTTAACGAG 

- 42 9 1 TGTCTCCAAAAATAACGTCCACAGAAGGGACAAATAACAAAGGGAAAGATGAC^ 
-4226 GGGCACTAACCCTTACAATGCAGATAGACAC^ 

-4161 CAGAAGGTTAAATAAATTTTCCTGGTTATTCTGATACAAC^ 
-4096 TAAAACTTAAAATGATGTATTTAAAAGGAAGA 

-4031 AGATTACTACTAATCCTCAGGAGAAGGGGTAGAGGAGAAACTCCATAAAGGCAACTGGA 

- 3 9 6 6 GTATTAGGAAGCACCTCAAGAACACAATAGCAGGAAGTAGCTAGAGAACAAAGAGAAGAAA^ 
-3901 GAAAAAAAAAATCCCTTTTTATTITrcTGTrrcCATO 

-3836 CTTTATTTTCACCCTCCACAGCCATGAGAGC CTCTGGGCAGGAGTTCTCCTCGCCTCTCCCTGTT 
-3771 CCAATCACCTCTAACATTTCTGCCTATTGTTCTGCCCAG^ 
-3706 CAAAGACCTCTTGAATTAAGTCCAAATGCTACACTCTGG 
-3641 CCTGACTTTTCCACCCTCAGCCTCCTTGAT 

- 3 57 6 TGTCTTTGCTCCTGCAAATTTGTTCATTCTCCT^ 

FIG. 14D 
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-3511 AGCTGTGGATATCATGGTATCTATTGTCTATCATGCTA 

-3446 ATTTTATATGGTACTAGTCTAAATTGACATCTACTGTC 

-3381 AGACTGTACACAAAATTTAATTATCTCATGAATA^ 

-3316 TATTTTGGATATACTATGCTAAATAAAAC^TATTATTAAAATTAACT^ 

-3251 CTTTCAATATGGCTACTAGAGCTITITAAA^ 

-3186 AATGOXOTttCC^^ 

-3121 GGCACACTGCCTGCATTAAGGGCAATGAATGCC^ 
-3056 TTTCTTTGAGAGCX^TCATCACCa 

-2991 AGACTGCTTGATATTCTACAGQAAAGATCACAGTTTTAAGI^ 
-2926 TCTCTATCTirCACACATTACACAGCCTCTCT 

-2861 GATAATAACCCATCTCAAATGTTTACTATGAGGATTATTCAAAGAATGG 

-2796 AATAAATGATAACTAGTACTACCGCCACTACTC^ 

-2731 AAGGACCATTTCCGGATGGAGGATAA^^ 

-2666 CACCTGGAAAGGTCAACTATATACAAGCCTGCAAGTC^ 

-2601 GACTCTATAGACTGTCTCCTCTITCCTGA 

-2536 OTCCTTCCATIGOTTn^^ 

-2471 CAAAACCCCAAGGAATTACTCAAATACTCACAT^ 
-2406 TTTTTAATATTCTGAAACTCATTGTTTTTAAAATGC^ 
-2341 GGCCTGCAAAGCGAAAGGCAGAGAGAATGAAACCCATAGAGA 
-2276 GACTCGTTTATTTTATAATGTAAATTAGTCT 

F1G.14E 



-2211 TGGAAAATACAAAGAATA^ 

-2146 ACTATTAAAATGGTGGTTTACTTCCTTTTATTAATTTTCTC 
-2081 GTATGTACAATTTTATGTTCTATTTTTGAAT^ 

-2016 AAATAATATATGCTCATAATAGAACATTTTAAATGCAAATAAAACAAAATAAATC 

-1951 GTAATATTTATTAAATTTTCTCCAAGTGCACGA 

-1886 GCCTAATAACCCTATTTCCAGACCTCTTCTC 

-1821 AAGGTATGAAGTGAAAAGATAAAGATTTTTCTTCCTTGCIGCATC 

-1756 CCCCAGGGTAACTACTATTAATAGATAGTAATTCTACCCAAAGGAAAAAATCATATGCATATA 
-1691 AGCATCATATGTATACCTTTCTAGTAACTTACAAAACAAATGATAATA 

Pvull (-1580) 

-1626 TGTATTGCTCITTTCACTAAATGTATCTGTGATATGTGTCTATATC^ 

-1561 TGGCTGAATAATATTCCATCTTGTCCACGT^ 
-1496 ATTTGTCTTTGTTACTATGATAGTAATA^ 
-1431 TACACATGCACATACACATGCATATTTCTGCAG 
-1366 TGCAAGTTAAAGGAACAATCTCATTGCTTGAAATTTTAAAT^ 
-1301 TGGTCTCTCCnTGTAAGCTAGTTTGGGCTTTCTCACAG 
-1236 TCCTGGCCAAAGAGCAGAGTGCCACAGACCACAACTGCT^ 
-1171 TCTCTTTTTTCTATTAATAACTTTGTA 
-1106 GTAAGAGJCTTATTTTTCTGAACCAGGAAGTGGTTCA 
-1041 CTCTTCTGTTAGCTTTTGTGAAATGGTCAAAAACATAGCAG^ 
-976 ACCCTGGTTCGGCCTTCTCTATCCTTGT^ 

FIG. 14F 



-911 TGTGCAGTTTCTCCTCTGTGTAGGATCA^ 

-845 CCTAGACTATTCGAGTGCCTTTCAGAAGTT^ 

-779 GGCAAAACTCCTTGCAGTTTCAGCTACTATTC 

-713 CITTCAGTATCCAAAGAAGATTGGTTCTAGGACCACCATCCC 

-647 agaggat<x:tcaattccctcttataaaacgtigcagtait^ 

-581 gtatatittaaatcatccctagattacttataatacctgatacaatataaatg^ 

-515 acactgtatctttaaaatttacattattt^ 

-449 aaatatittccatctacagtcagtagaatc 

-383 gtatcttttagtgttttgaggttcttg 

FIG. 14G 
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- 3 5 6 AATTCTCAGGTCGTTTGCTTTCCTTTC - 

-291 TTCACTGAAACTTTAAAAJ^CATTAC^ 

-226 TATCATAAGATAGGAGCTTAAATAAAGAGTm 

-161 ACTGAAAGGGAGAAGTGAAAGTGGGAAATTCCTCTGAAT 



-31 OGTAGTAGGCGArACTCTTCGTOrrGTCAAC AI£AC£MSMS2^£ISCTSCAA 

73 M£HsaisaiEcmaAM^£ ass me m as isr ca& aag. 

121 CIS STS TSS CAA TTS ME SSS ASS STI 2M IAS TSS STS MS SAS ASS 



169 ATS AAS XTT GA£ ATS SSI SAS SAS ATT MS SAS STS SAS SAS US SAG. 

217 mG^G^GSGamffiM£mG^m£ECASMvM22ir 

265 GCT ATT TTC AGA CAA GAT USA TST ASS AST SSS TSS AAT SAS AST ATT 

313 SH SAS AAS STS STS SST AAI ST£ TAT SAT SAS MA MS SAT STS MS 

361 ASA STS STS SAA SAA MA STS SAS MA SM SAT ITS ASS AGS SSA AAA 

409 STS ATS ASS AST STS SAS STS AAA ASA TAT TAT SSS ASS ATT STS SAT 

457 TAS STS AAS SSS MS SAS IAS AST SAS TST SSS TSQ ASS ATA STS ASA 

505 GTS SAA ATS STA ASS AAS TIT TAS US ATT AAS ASA STT ASA SST TAS 

Bg' 11 ( 565 ) ,,~™ 

553 STS SSA AAS TGAAGATCTCC T Ar^CTGTGCCTCTr^ 

615 TT'AACC AGCAGATGCTGTTTA AGTCACTGA T rcyTA ATGTACTGCATATGAA AGGACACTAGAAS 



-96 




AUG (1) 



Pvull (199) 
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680 ATTTTG AAATTTTTATTAAATTATGAG 
745 TTATTTTTGGTGCAAAAGTCAACATGGCAGTTTT^ 
810 TTATAAAATTGCCAAGTACCTATTAGTTGTTCT^ 
875 CTGGCCCCTGCCTTTAAGGAATTTAAAATTCAAGAAAGCCATGATG 
940 CAATAAGGGGACCTGAACCTTATGGGGGAATAAATATGGCATGAACTGCTGTG 
1005 AAAAGGAAAGCTGGAGGGTCTGGAACTAAACCTGGGGTTCCCATTCC 

Ball (1099) 

1070 ATTCTCTCATCATAAAGTTAGAATTGAGCTGGCCATCAGGAATAGCC^ 
1135 TTGTGTTCTCCCTAACCTTCCCCAGTTATT^ 
1200 TAATTATGTGCCCCCCACCATCCCTGCAA 

FIG. 15B 



